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Abstract — Spatially coupled codes have been of interest re- 
cently owing to their superior performance over memoryless 
binary-input channels. The performance is good both asymp- 
totically, since the belief propagation thresholds approach the 
Shannon limit, as well as for finite lengths, since degree-2 variable 
nodes that result in high error floors can be completely avoided. 
However, to realize the promised good performance, one needs 
large blocklengths. This in turn implies a large latency and 
decoding complexity. For the memoryless binary erasure channel, 
we consider the decoding of spatially coupled codes through a 
windowed decoder that aims to retain many of the attractive 
features of belief propagation, while trying to reduce complexity 
further. We characterize the performance of this scheme by 
defining thresholds on channel erasure rates that guarantee a 
target erasure rate. We give analytical lower bounds on these 
thresholds and show that the performance approaches that of 
belief propagation exponentially fast in the window size. We 
give numerical results including the thresholds computed using 
density evolution and the erasure rate curves for finite-length 
spatially coupled codes. 

Index Terms — Low-density parity-check codes, Belief propa- 
gation, Erasure channels, Spatial coupling, Windowed decoding, 
Iterative decoding. 



I. Introduction 

SPARSE graph codes have been of great interest in the 
coding community for close to two decades, after it was 
shown that statistical inference techniques on graphical models 
representing these codes had decoding performance that sur- 
passed that of the best known codes. One class of such codes 
are low-density parity-check (LDPC) codes, which although 
introduced by Gallager in the 60's [2] were rediscovered in the 
90's after the advent of Turbo Codes 1 3 1 and iterative decoding. 
Luby et al. showed |4|, [5] that a decoder based on belief 
propagation (BP) |6| had very good performance for these 
codes over the binary erasure channel (BEC). This superior 
performance of LDPC codes was shown by Richardson and 
Urbanke (7| to be true over a broader class of binary-input, 
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memoryless, symmetric-output (BMS) channels. Furthermore 
these codes were optimized to approach capacity on many of 
these BMS channels (8), @- 

The convolutional counterparts of LDPC block codes were 
first introduced by Felstrom and Zigangirov in pO) . There 
is considerable literature on the constructions and analysis 
of these ensembles (TTJ-|[T4). The BP thresholds for these 
ensembles were reported in [15] and shown to be close to 
capacity in fl6) . In p7| the authors construct regular LDPC 
convolutional codes based on protographs [18] that have BP 
thresholds close to capacity. In p9) , Kudekar et al. considered 
convolutional-like codes which they called spatially coupled 
codes and showed that the BP thresholds of these codes 
approached the MAP thresholds of the underlying unstructured 
ensembles over the BEC. This observation was made for 
protograph-based generalized LDPC codes in [20]. Evidence 
for similar results over general BMS channels was given in 
pT[ , and proven recently in p2[ . Moreover this phenomenon, 
termed threshold saturation, was shown to be a more generic 
effect of coupling by showing an improvement in performance 
of systems based on other graphical models: the random 
if-SAT, Q-COL problems from computation theory, Curie- 
Weiss model from statistical mechanics |23|, and LDGM 
and rateless code ensembles p4| . Non-binary LDPC codes 
obtained through coupling have also recently been investigated 



The good performance of spatially coupled codes is appar- 
ent when both the blocklength of individual codes and the 
coupling length becomes large. However, as either of these 
parameters becomes large, BP decoding becomes complex. We 
therefore consider a windowed decoder that exploits the struc- 
ture of the coupled codes to reduce the decoding complexity 
while maintaining the advantages of the BP decoder in terms 
of performance. An additional advantage of the windowed 
decoder is the reduced latency of decoding. The windowed 
decoding scheme studied here is the one used to evaluate the 
performance of protograph-based codes over erasure channels 
with and without memory J2"6)-J2"8). The main result of this 
paper is that the windowed decoding thresholds approach the 
BP thresholds exponentially in the size of the window W . 
Since the BP thresholds are themselves close to the MAP 
thresholds for spatially coupled codes, windowed decoding 
thus gives us a way to achieve close to ML performance with 
complexity reduced further beyond that of the BP decoder. 

The rest of the paper is organized as follows. Section [TT] 
gives a brief introduction to spatially coupled codes. In Section 
lllllwe discuss the windowed decoding scheme. We state here 



the main result of the paper which we prove in Section IV 



We give some numerical results in Section [V] and conclude in 
Section VI Much of the terminology and notation used in the 
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paper is reminiscent of the definitions in 
refer the reader to this paper. 



II. Spatially Coupled Codes 

We describe the [di,d r ,J, L) spatially coupled ensemble 
that was introduced in fl9) in terms of its Tanner graph. There 
are M variable nodes at each position in [L] = {1, 2, • • • , L}. 
We will assume that there are check nodes at every 

integer position, but only some of these interact with the 
variable nodes. The variable (check) nodes at position i 
constitute the i th section of variable (check, resp.) nodes in the 
code. The L sections of variables are together referred to as the 
chain and L is called the chain length. For each of the di edges 
incident on a variable at position i, we first choose a section 
uniformly at random from the set {i, i + 1, • • • , i + 7— 1}, then 
choose a check uniformly at random from the M^r- checks in 
the chosen section, and connect the variable to this check. We 
refer to the parameter 7 as the coupling length. It can be shown 
that this procedure amounts roughly to choosing each of the 
d r connections of a check node at position i uniformly and 
independently from the set {i— 7+1, i— 7+2, • • • , i). Observe 
that when 7 = 1 this procedure gives us L copies of the 
(di, d r ) -regular uncoupled ensemble. Since we are interested 
in coupled ensembles, we will henceforth assume that 7 > 1. 
Further, we will typically be concerned with this ensemble 
when L 7, in which case the design rate given by p9| 

R(d l ,d r , 1 ,L) = + 0(- 7 ' 



L 



is close to 1 — 



BP Performance 

In the following we will briefly state known results that are 
relevant to this work. See [19| for detailed analysis of the BP 
performance of spatially coupled codes. The BP performance 
of the (di, dr, 7, L) spatially coupled ensemble when M —> 00 
can be evaluated using density evolution. Denote the average 
erasure probability of a message from a variable node at 
position i as x^. We refer to the vector x = (xi, x^ ■ ■ ■ ,Xi) 
as the constellation. 

Definition 1 (BP Forward Density Evolution): Consider 
the BP decoding of a (di, d r , 7, L) spatially coupled code 
over a BEC with channel erasure rate e. We can write the 
forward density evolution (DE) equation as follows. Set the 
initial constellation to be a;' -* = (1, 1, • • • ,1) and evaluate 
the constellations according to 



0, if i [L] V I, and otherwise 



1 _ I 'S^'>'~ 1 |'1 — I X^T -1 



i+j-k 



di-l 



(1) 

This is called the parallel schedule of the BP forward density 
evolution. □ 



For ease of notation, we will write the equation 



fe=0 



dl-1 



-7+1' ' ' 



(2) 



It is clear that the function g(-) is monotonic in each of its 
arguments. 

Definition 2 (FP of BP Forward DE): Consider the paral- 
lel schedule of the BP forward DE for the (di,d r ,^,L) 
spatially coupled code over a BEC with erasure rate e. It can 
be easily seen from the monotonicity of <?(•) in Equation |2]) 
that the sequence of constellations {x^>}fL Q are ordered as 
rffi y a;^ 1 ) V £ > 0, i.e., xf ] > x[ e+1) V I > 0, i G [L] 
(the ordering is pointwise). Since the constellations are all 
lower bounded by the all-zero constellation 0, the sequence 
converges pointwise to a limiting constellation x^°°\ called 
the fixed point (FP) of the forward DE. □ 

It is clear that the FP of forward DE .t^°°^ satisfies 



„(<*>) 




(00) 



%i[L\ 

1), *e[L]. 



Definition 3 (BP Threshold): Consider the parallel sched- 
ule of the BP forward DE for the (di,d r ,j,L) spatially 
coupled code over a BEC with erasure rate e. The BP threshold 
e BP (di,d r ,j,L) is defined as the supremum of the channel 
erasure rates e € [0, 1] for which the FP of forward DE is the 
all-zero constellation, i.e., a;*- 00 ) = 0. □ 

Table [I] gives the BP thresholds evaluated from BP forward 
DE for the (di = 3, d r — 6, 7, L) coupled ensemble for a few 
values of 7 and L rounded to the sixth decimal place. The 



L\ 7 


2 


3 


4 


5 


6 


16 


0.488079 


0.488220 


0.489806 


0.495671 


0.505866 


32 


0.488079 


0.488150 


0.488151 


0.488164 


0.488294 


64 


0.488078 


0.488145 


0.488148 


0.488149 


0.488150 


128 


0.488075 


0.488137 


0.488142 


0.488144 


0.488146 



TABLE I 

BP Thresholds e BP (di = 3, d r = 6, 7, L) . 



MAP threshold of the underlying (di, rf r )-regular ensemble is 
e MAP (d( = 3, d r = 6) « 0.488151. We see from the table that 
the BP thresholds for (di, d r , 7, L) spatially coupled codes are 
close to the MAP threshold of the [di, d r ) -regular unstructured 
code ensemble. Note that some of the threshold values in 
Table [I] are larger than the MAP threshold of the underlying 
(di, d r )-regular ensemble. This is because the rates of the 
spatially coupled ensembles are smaller than the (di,d r )- 
regular ensemble, and depend on the values of 7 and L as 
stated in the beginning of this section. 

It was shown in |19| that the BP thresholds satisfy 

lim lim e BF (d h d r ,j,L) = lim lim e MAP (d h d r ,j, L) 

7— >-oo L— >oo 7—^00 L— >oo 



MAP 



(di,d r ). 



This means that the BP threshold saturates to the MAP 
threshold, and we can obtain MAP performance with the 
reduced complexity of the BP decoder. Later when we analyze 
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the windowed decoder, we will want to keep the coupling 
length 7 finite and hence will be concerned with the quantity 



e BP (cM r , 7 )^ lim e BP (cM r , 7 ,L) 

L—too 



(3) 



as a measure of the performance of the BP decoder. It 
immediately follows from fl9] Theorem 10] that 



.BP 



{di,d r ,~/) < e (di,d r ) 



III. Windowed Decoding 

The windowed decoder (WD) exploits the structure of 
the spatially coupled codes to break down the BP decoding 
scheme into a series of sub-optimal decoding steps — we trade- 
off the performance of the decoder for reduced complexity 
and decoding latency. When decoding with a window of size 
W, the WD performs BP over the subcode consisting of the 
first W sections of the variable nodes and their neighboring 
check nodes and attempts to decode a subset of symbols (those 
in the first section) within the window. These symbols that 
we attempt to decode within a window are referred to as the 
targeted symbols. Upon successful decoding of the targeted 
symbols (or when a maximum number of iterations have been 
performed) the window slides over one section and performs 
BP, attempting to decode the targeted symbols in the window 
in the new position. 

More formally, let x be the constellation representing the 
average erasure probability of messages from variables in each 
of the sections 1 through L. Initially, the window consists only 
of the first W sections in the chain. We will refer to this as 
the first window configuration, and as the window slides to the 
right, we will increment the window configuration. In other 
words, when the window has slid through (c — 1) sections to 
the right (when it consists of sections c, c+1, • • • , c+ W — 1), 
it is said to be in the c th window configuration. The c th window 
constellation, denoted U^ c y is tne average erasure probability 
of the variables in the c th window configuration. Thus, 

V {c} = (2/l,{c})2/2,{c}>- • • ,VW,{c}) = (x c ,x c+1 , ■ ■ ■ ,X c+W -l) 

for c G [L], where we assume that x c — V c > L. Thus 
the c th window constellation, 2/r c i> represents the "active" 
sections within the constellation x. While referring to the 
entire constellation after the action of the c th window, we will 
write X{ c y- When the window configuration being considered 
is clear from the context, with some abuse of notation, we 
drop the {c} from the notation and write y = (yi,- ■ ■ ,yw) 
to denote the window constellation. 

Remark 1 (Note on notation): When we wish to emphasize 
the size of the window when we write the constellation, we 
write y {w} = (y l: (w),y2,(w), - ■ ■ ,Vw,(W))- Note that the 
window configuration and the window size are specified as 
subscripts within curly brackets {■} and angle brackets (•), 
respectively. Finally, when the constellation after a partic- 
ular number of iterations £ of DE is to be specified, we 
write y^> = (y[ e \ y^\ ■ • • , 2/w0> where the iteration number 
appears as a superscript within parentheses (•). Although 
(w) would b e tne m °st general way of specifying the 



window constellation for the c th window configuration with a 
window of size W after I iterations of DE, for notational 
convenience we will write as few of these parameters as 
possible based on the relevance to the discussion. □ 

A. Complexity and Latency 

For the BP decoder, the number of iterations required to 
decode all the symbols in a (di,d r ,j,L) spatially coupled 
code depends on the channel erasure rate e. Whereas when e e 
[0, e BP (di, d r )] this required number of iterations can be fixed 
to a constant number, when e g (e BP (<i;, d r ), e BP (di, d r , 7, L)] 
the number of iterations scales as O(L) (29]. Therefore, in the 
waterfall region, the complexity of the BP decoder scales as 
0(ML 2 ). For the WD of size W, if we let the number of 
iterations performed scale as 0(W), the overall complexity is 
of the order 0(MW 2 L). Thus, for small window sizes W < 
\[Tj, we see that the complexity of the decoder can be reduced. 
A larger reduction in the complexity is possible if we fix the 
number of iterations performed within each window. 

Another advantage of using the WD is that the decoder 
only needs to know the symbols in the first W sections of the 
code to be able to decode the targeted symbols. Therefore, in 
latency-constrained applications, the decoder can work on-the- 
fly, resulting in a latency which is a fraction ^ that of the BP 
decoder. 



B. Asymptotic Performance 

The asymptotic performance of the (di,d r ,j,L) spatially 
coupled ensemble with WD can be analyzed using density 
evolution as was done for the BP decoder. We will consider 
the performance of the ensemble with M — > 00 when the 
transmission happens over a BEC with channel erasure rate 
e 6 [0, 1]. Further we will assume that for each window con- 
figuration, infinite rounds of message passing are performed. 

Definition 4 (WD Forward Density Evolution): Consider 
the WD of a (di, d r , 7, L) spatially coupled code over a BEC 
with channel erasure rate e with a window of size W. We 
can write the forward DE equation as follows. Set the initial 
constellation x^ according to 



1. 



,{0} 



e[L] 
0, i$[L]. 



For every window configuration c = 1, 2, • • • , L, let 

(°) 1 \ 

Hie} = \ x c,{c-l}iX c +l,{c-l}, ■ ■ ■ ,%c+W-l,{c-l}) 

and evaluate the sequence of window constellations {y^}f^ 1 
using the update rule 

where for every I, for indices i ^ [W], we set yf\ c \ = 
x c+i-i,{c-i} on me right hand side. Then set x^ c y as 



,{c} 



x i,{c-l}> i 7^ c 

„(°°) 



□ 
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Discussion : Note that the constellation x^ keeps track of 
the erasure probabilities of targeted symbols of all window 
configurations up to the c th , followed by erasure probability 
of 1 for the variables in sections c+1 through L, and zeros for 
sections outside this range. As defined, X{ c j discards all in- 
formation obtained by running the WD in its c th configuration 
apart from the values corresponding to the targeted symbols. 
In practice, it is more efficient to define 



H,{c} 



%i,{c— 1} i 
,(oo) 



i £ {c, c+ 1, • 
otherwise. 



W-l} 



^Vi-c+l,{c}> 

In the sequel, we will stick to Definition [4] We do this for 
two reasons: first, discarding some information between two 
window configurations can only perform worse than retaining 
all the information; and second, this assumption makes the 
analysis simpler since we then have yf*\ = 1 V c € [L]. □ 

Definition [4] implicitly assumes that the limiting window 
constellations yvy exist. The following guarantees that the 
updates for x i { c j are well-defined. 

Lemma 5 ( c' h Window Configuration FP of FDE): 
Consider the WD forward DE (FDE) of a (di,d r ,^,L) 
spatially coupled code over a BEC with erasure rate e with a 
window of size W. Then the limiting window constellation 
exists for each c G [L], We refer to this constellation as 
the c th window configuration FP of forward DE. 



Proof: As noted earlier, y?\ = 1 V c G [L], and 



(o) 

m 



1 >- e >- 



yf}\- By induction, from the monotonicity 



of g(-), this implies that y { ^ } >z yft X) V £ > 0. Since 
these constellations are lower bounded by 0, the c th window 
configuration FP of FDE y^^ exists for every c € [L], ■ 

The c th window configuration FP of forward DE therefore 
satisfies 



(oo) 



•Ec+i— l,{c— 1} 5 
(oo) 



for every c € [L]. Since the X{ } vector has non-zero values by 
definition, from the continuity of the WD FDE equations, so do 
the vectors x^ c \ V c. Hence cannot satisfy Equation Q, i.e., 
cannot be the c th window configuration FP of forward DE. 
Therefore, y^ >- V c € [L]. This means that WD can never 
reduce the erasure probability of the symbols of a spatially 
coupled code to zero, although it can be made arbitrarily small 
by using a large enough window. Therefore, an acceptable 
target erasure rate 8 forms a part of the description of the 
WD. We say that the WD is successful when Xi L \ ^ 8. 

Lemma 6 (Maximum of Xr L y): The vector x^l} obtained 
at the end of WD forward DE satisfies a^_i/£} < 
x i,{L} Vi € [L - W — 7 + 2], Moreover, 3 x G [0,1] 
independent of L such that ^i,{z,} <iVi. 

Proof: By definition, = Viux V i G [L]. The claim 

is true for i = 1 since = !/{m > = £q,{l}- F° r me 



(4) 



i th window configuration, it is clear from Definition [4] that 



V{i-i} 2 



J/S , i € [£ — W — 7 + 2] . By induction, from the 

monotonicity of <?(-), it follows that y\u_i\ < f° r * m 

this range. 

For i > L — W — 7 + 2, the above claim is not valid 
because we defined Xj { c j = for j > L and we cannot 
make use of the monotonicity of g(-) since some arguments 
(corresponding to sections up to the L th section) are increasing 
and others (corresponding to the sections beyond the L th 
section) decreasing. Nevertheless, we can still claim that 



%i,{L} < a;»,{oo} 



V i G N where is the vector of erasure 



probabilities obtained after WD for a spatially coupled code 
with an infinite chain length, i.e., L = 00. For L — 00, the 
sequence {2^,(00} } is non-decreasing and since the are 
probabilities, they are in the bounded, closed interval [0, 1]. 
Consequently, the limit lirnj_ i . 00 ^{oo} 



exists in the interval 



[0,1], and lirrii. 



,{00} = SUPj Xj^oo} 



x. 



As a consequence of Lemma [6] we can say that the WD is 
successful when x < 8. This definition of the success of WD is 
independent of the chain length L and allows us to compare 
the performance of WD to that of the BP decoder through 
the thresholds defined in Equation d3j. Note that although the 
upper bound for x in Lemma [6] is a trivial bound, we will 
in the following give conditions when x can be made smaller 
than an arbitrarily chosen 5, thereby characterizing the WD 
thresholds. 

Definition 7 (WD Thresholds): Consider the WD of a 
(di,d r ,j,L) spatially coupled code over a BEC of era- 
sure rate e with a window of size W. The WD threshold 
e WD (dz, d r , 7, W, S) is defined as the supremum of channel 
erasure rates e for which x < 8. □ 



Discussion : Since we defined the WD threshold based on 
x, it is clear that this is independent of the chain length L. 
On the other hand, if we used maXj e rj,i x^ six < S as the 
condition for success of the WD in our definition, we would 
obtain an i-dependent threshold. But x denotes the "worst- 
case" remanant erasure probability after WD, and imposing 
constraints on x therefore guarantees good performance for 
codes with any L. 

Note that keeping x < 8 is sufficient to guarantee an a- 
posteriori erasure probability p e smaller than 8 because 



/x\37+r /x\37^t 



< x < 8. 



□ 



We will now state the main result in this paper and prove 
it in the following section. 

Theorem 8 (WD Threshold Bound): Consider windowed 
decoding of the (di, d r , 7, L) spatially coupled ensemble over 
the binary erasure channel. Then for a target erasure rate 
8 < (5*, there exists a positive integer W / m i n (<5) such that 
when the window size W > W m i n (8) the WD threshold 
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satisfies 



WD 



(d ; ,d r , 7 ,M/,<5)>(l-^^T) 
x (e^{d u d ril )-e-^ 



-Alnln g-C 



(5) 



Here A, B, C, D and are strictly positive constants that 
depend only on the ensemble parameters di , d r and 7. ■ 

Theoremj8] says that the WD thresholds approach the BP 
threshold e (dj, d r ,j) defined in Equation ([3]) at least expo- 
nentially fast in the ratio of the size of the window W to the 
coupling length 7 for a fixed target erasure probability 6 < 5*. 
Moreover, the sensitivity of the bound to changes in 5 is small 
in the exponent in |5]l owing to the log log i factor, but larger 
in the first term in the product on the right hand side of |5]) 
where it is roughly linear in 6. However, since we intend to set 
5 to be very small, e.g. 10~ 15 , the first term does not influence 
the bound heavily. The requirement that W > W m i n (5) is 
necessary to keep the term within parentheses in the exponent 
non-negative. Therefore the minimum window size required, 
W m { n (6), also depends on the constants A, C and D and, in 
turn, on the ensemble parameters di , d r and 7. 

The bound guaranteed by Theorem[8]is actually fairly loose. 
Numerical results suggest that the minimum window size 
W / min(^) ls actually much smaller than the bound obtained 



from analysis (cf. Section IV I. Density evolution also reveals 



that for a fixed window size, the WD thresholds are much 
closer to the BP threshold than the bound obtained from 
Theorem [8] 

We note here that the gap between analytical results and 
numerical experiments is mainly due to the reliance on bound- 
ing the density evolution function in Equation (|2]i using the 
counterpart for regular unstructured LDPC ensembles, which 
proves to be easier to handle than the multivariate Equation 
(|2) (See, e.g., the bound in Hi. However, the scaling of the 
WD thresholds with the window size and the target erasure 
probability seem to be as dictated by the bound in d3), 
suggesting that Theorem [8] captures the essence of the WD 
algorithm. 

Table gives the WD thresholds obtained through forward 
DE for the (di = 3,d r — 6,7 = 3,L) spatially coupled 
ensemble for different target erasure rates 5 and different 
window sizes W. These thresholds have been rounded to the 
sixth decimal point. A few comments are in order. As can 



w\s 


icr b 




10 -l» 


4 


0.068403 


0.000772 


0.000008 


8 


0.472992 


0.390749 


0.254339 


16 


0.487504 


0.487504 


0.487504 



TABLE II 

WD THRESHOLDS e WD (d; = 3,d r = 6,7 = 3, W,S). 



be seen from the table, the thresholds are close to e BP (d; = 
3, d r — 6,7 = 3) « 0.488137 even for window sizes that 
are much smaller than the W m i n (5) obtained analytically, e.g., 
W = 16. Moreover, the WD thresholds are more sensitive 
to changes in 6 for small window sizes where the bound 
in Theorem [8] is not valid. It is obvious that the thresholds 
decrease as i5 is decreased. Also note that for a fixed target 



erasure rate, the window size can be made large enough to 
make the WD thresholds close to the BP threshold. 

IV. Performance Analysis 

In this section, we prove Theorem [8] in steps. First, we 
analyze the performance of the first window configuration. We 
will characterize the first window configuration FP of forward 
DE. We will establish that for the variables in the first section 
of the window, the FP erasure probability can be made small at 
least double-exponentially in the size of the window. We will 
show that this is possible for all channel erasure rates smaller 
than a certain e, which we will call the first window threshold 
e FW (di,d r ,j,W,S), provided the window size is larger than 
a certain minimum size. 

Once we have this, we consider the performance of the c th 
window configuration for 1 < c < L. In this case also, we will 
show that the FP erasure probability of the first section within 
the window is guaranteed to decay double-exponentially in the 
window size. As for the first window configuration, this result 
holds provided the window size is larger than a certain minimal 
size and this time the minimal size is slightly larger than 
the minimal size required for the first window configuration. 
Moreover, such a result is true for channel erasure rates smaller 
than a value which is itself smaller than the first window 
threshold, and this value will be our lower bound for the WD 
threshold. 

A. First Window Configuration 

From Definition |4l forward DE for the first window config- 

(0) 

uration amounts to the following. Set y)^. = 1 and evaluate 

the sequence of window constellations {y^}^ 1 according 
to 



0, 



7+1, {1}' 



1. 



i < 



Since 



(0) 



i > W. 



V i, so 



is non-decreasing, i.e., y\{ 1} s 

is the first window configuration FP, y^j, by induction and 
mono tonicity of </(•). 

Fig. [T] shows the first window configuration FP of forward 
DE for the (di = 3, d r = 6, 7 = 3, L) ensemble with a window 
of size W — 16 for a channel erasure rate e = 0.48812. 

The scheduling scheme used in the definition of the window 
configuration FPs is what is called the parallel schedule. In 
general, we can consider a scheduling scheme where, in each 
step, a subset of the sections within the window are updated. 
We say that such an arbitrary scheduling scheme is admissible 
if every section is updated infinitely often with the correct 
boundary conditions, i.e., with the correct values set at the 
left and the right ends of the window. It is easy to see from 
the standard argument of nested computation trees (see, e.g., 
(T91) that the FP is independent of the scheduling scheme. 
We know that the first window configuration FP of forward 

DE ' Mjrr is non-decreasing, i.e., V^Jx < fi+i^ij V ^ 
following shows the ordering of the FP values of individual 
sections in windows of different sizes. With the understanding 
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1 1 







1 2 3 4 5 6 7 



9 10 11 12 13 14 15 16 



Fig. 1, The first window configuration FP of forward DE for the (di = 
3,d r = 6,7 = 3, L) ensemble with a window of size W = 16 for e = 
0.48812. The left and the right boundaries are fixed at and 1 respectively. 
The sections within the window are indexed from 1 to W = 16. The first 



section has a FP erasure probability j/J°?|, 



2 X 10" 



that we are considering only the first window configuration in 
this subsection, we will drop the window configuration number 
from the notation for window constellations thoughout this 
subsection for convenience. 

Lemma 9 (FPs and Window Size): Let y^ w ^ and y^ w+1 ^ 
denote the first window configuration FPs of forward DE with 
windows of sizes W and W + 1 respectively for e £ [0, 1]. 
Then, 

Vi,(W) > Vi,(W+l) > Vi-l,(W) 

where Uijws denotes the FP erasure probability of the i th 
section in a window of size W. 



Proof: Consider the following schedule. Set y 



(0) 

(W+l) 



{y, w y 1) and evaluate the sequence of window constellations 
{yjv^+i)} according to Equation (|6j. Clearly, we have 



'-(W+l] 



^(y {wy e)^(y {wy 



- J°) 



1) = !/ 



(W+l) 



so that the sequence is pointwise non-increasing 

by induction. We claim that this schedule is admissible. This 
is true because the DE updates are first performed infinitely 
many times over the first W sections to obtain y^ +1 y and 
then over all the W + 1 sections infinitely many times again. 
Therefore the updates are performed over all sections infinitely 
often with the correct boundary conditions. The limiting FP 
must hence be exactly y, w+1 i and the first inequality in the 
statement of the lemma holds. Intuitively, this is true because 
in going from W to W + 1 and checking the i th section, we 
have moved further away from the right end of the window 
(where yi = 1) while remaining at the same distance from the 
left end (where yi =0). 

To prove the second inequality, consider the following 



schedule. Set y 



(0) 

yi,{w) 



, W and 



evaluate the sequence of constellations according to 

Equation |6]). Since y Q i W \ 



(W}< 

< y h{w+iy we must have 

yfw) — yfw) anc ^ induction the sequence of constellations 
thus obtained is also pointwise non-increasing. Again we claim 
that the above mentioned schedule is admissible. This is true 
because we first update all W sections within the window 
and also the zeroth section infinitely often, and then set 
the boundary condition that the zeroth section also has all 



variables completely known. In all, every section within the 
window gets updated infinitely often with the correct boundary 
conditions. The limiting FP must hence be exactly V^ w ^ an d 
the second inequality claimed in the statement of the lemma 
follows. As in the previous case, this is intuitively true because 
in going from the (i + l) th section with window size W + l 
to the i th section with window size W, we have moved closer 
to the left end of the window while maintaining the distance 
from the right end. ■ 

We now give some bounds on the FP erasure probabilities 
of individual sections within a window. 

Lemma 10 (Bounds on FP): Consider the WD of the 
(di, d r ,j, L) ensemble with a window of size W over a 
channel with erasure rate e and di > 3. The first window 
configuration FP y satisfies 



> 



7 . 



1 



di-l 



2" 



y%+j 



Jfc<e(l-afc(l-j/ <+ fc) dr " 1 ) 



d,-i 



for i e [l,W],j G [0,W + 1 - i],k e [0,7 - 1], where 
Q _ (7-fc-l)(7-fc) g 



We relegate the proof to Appendix [I] The following shows 
that once the FP erasure probability of a section within the 
window is smaller than a certain value, it decays very quickly 
as we move further to the left in the window. 

Lemma 11 (Doubly-Exponential Tail of the FP): Consider 
WD of the (di,d r ,j,L) ensemble with a window of size W 
over a channel with erasure rate e € (0, 1). Let di > 3 and let 
y be the first window configuration FP of forward DE. If there 



exists an i 6 [W] such that yi < So = [ (d r — l) d ' 
then 



where ^ = 5 e d i- 2 and tp = ln(J^) 



'j t"i- aim w = = ^ ln 7 > °' 

Proof: Since the FP is non-decreasing, we have 



2/i-( 7 -i) = £ 5(?/i-2(7-i), •• ' ,Vi) 

< tg(yi,yi, ■■■ ,Vi) 

^eil-il-y^- 1 ) 11 '- 1 

< e((d r ~ l)^)*- 1 

which can be applied recursively to obtain 

(d| _i)i_i 



(7) 



(d,-i)j-i 
< e 3T=3 



(8) 



where ^ and ip are as defined in the statement. It is worthwhile 
to note that Sq is a lower bound on the breakout value 
for the (di, d r ) -regular ensemble p0| . The emergence of 
the breakout value in this context is not entirely unexpected 
since it is known that for the (di, d r ) -regular ensemble, the 
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erasure probability decays double-exponentially in the number 
of iterations below the breakout value, and in case of spatially 
coupled ensembles, the counterpart for the number of iterations 
is the number of sections (cf. Equation |7])). ■ 
We now show that the FP erasure probability of a message 
from a variable node in the first section, y\, can be made small 
by increasing the window size W for any e < e BP (di, d r , 7). 
Assuming that the window size is "large enough," we will 
count the number of sections, starting from the right, that have 
a FP erasure probability larger than a small 8 for a channel 



we have 



erasure rate e 



(dj,d r ,7) - Ae. 



Definition 12 (Transition Width): Consider WD of a 
(di, dr,"f, L) spatially coupled code over a BEC of erasure 
rate e. Let y be the 1 st window configuration FP of forward 
DE. Then we define the transition width r(e, 8) of y as 



-(e,S) = \{ie[W]:5< yi <l}\. 



a 



Note that from the definition of the transition width, it 
depends on the window size W. We first upper bound 
T (e, 8) < f(e, 8) such that the upper bound is independent 
of the window size W, and then claim from Lemma [9] that by 
employing a window whose size is larger than f (e, 8), we can 
guarantee yi < 8. 

Definition 13 (First Window Threshold): Consider WD of 
the (di, d r ,7, L) spatially coupled ensemble with a window 
of size W over a BEC with eraure rate e. The first window 
threshold e FW (d;, d r , 7, W, 8) is defined as the supremum of 
channel erasure rates for which the first window configuration 
FP of forward DE y satisfies yi < 5. □ 

From Definitions [12] and [13] we can see that by ensuring that 

W > f (e, 8), we can bound e FW (d h d r , 7, W, 8) > e. 

Proposition 14 (Maximum Transition Width): Consider the 
first window configuration FP of forward DE y for the 
(di,d r ,"f, L) spatially coupled ensemble with a window of size 

W < L for e € [ ^^^ , e BP (d h d r , 7)) = S. 
Then, 

r(e,S) < (7-l)(Alnln^ + Bln^- + c) =f(e,<5) 

provided 6 < 8 . Here Ae = e BP (d h d r , 7) - e, and A, B, C, D 
and Sq are strictly positive constants that depend only on the 
ensemble parameters di , d r and 7. ■ 

The proof is given in Appendix [II] This means that the 
smallest window size that guarantees yi < 8 for a channel 
erasure rate v ' ' '' v " rl is 



(S) = (7-l)fAlnln^ + Bln-^ + C N ) I +1 
L V d Ae mav /J 



> r 



.BP 



(di,d r ,j) 



,BP 



(di,d r ) 



where Ae r 



When W > ^((i), 



FW 



(d l ,d r , 1 ,W,S)>e BP (d l ,d ril ) 



1/ H 
— g BV 7 ~ 



-A In In g- 



(9) 



Discussion : We restricted e e S in Proposition 14 to obtain 
constants that are independent of e. As can be seen from 
the proof of the proposition, these constants are dependent 
on e, unless each is optimized in the range S . As we let 
the minimum e in $ approach e BP (di,d r ), the constants in 
the expression for f(e,S) blow up and the upper bound will 
be useless. It is therefore necessary to keep the minimum of 
e € & strictly larger than e BP (di,d r ) and the value chosen 
in the above was motivated by our intent to ensure that the 
first window threshold was closer to e BP (d;, d r , 7) than to 
e BP (di,d r ). Note that the increase in the upper bound for 
r(e, S) with decrease in e is purely an artifact of the upper 
bounding technique we have employed; i.e., it is obvious that 
as we decrease e, r(e, 5) also decreases. □ 

B. c' h Window Configuration, 1 < c < L 

We now evaluate the performance of the windowed de- 
coding scheme when the window has slid certain number of 
sections from the left end of the code. We arrive at conditions 
under which x is guaranteed to be smaller than 8 while 
operating with a window of size W. We start by establishing 
a property of x. 

Lemma 15 (FP Equation Involving x): Consider the func- 
tion fl(y) where 



fi(i«) 




where 



i < 1 

^{Vi+i-l)), i£[W] 
i > W 

i < 1 
i G [W] 
i > W. 



Then there exists a solution lu to the equation y = Q(y) such 
that oji = x. Moreover, lu_ is the smallest such constellation, 
i.e., if Q_ — tnen Q. t: w. 

Proof: We have 







^OO , {00} 

,.(«") 



(00) 

y i,{oo} 



/ ^oo; (00) i^oo; 

e 5iy_ 7+2 ,{oo}' " ' ' »0,{oo}'i'l,{oo}' 

E) 1 (°°) 

^5\^oo,{oo};'*' 5 ^00, {00}; 2/2,(00}'' 



,(00) 



A (OO) (OO) \ 

eg(xr--^x,y 2Aoo} ,--- ,2/ 7i{oo} )- 



• u (oo) ) 
5 "7, {00} > 

y(°°) ) 

' "7,(00} / 



Hence, if we define to as follows 



' - _ (0°) 
x — V\, {00}' 

,.(<») 

k »i,{oo}' 



i < 1 
i > 1, 



g 
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then it is clear that u = 7r(cj) = f2(cj). 

Note that any fixed point & of the function Q(-) has to 
satisfy lu >z yf^ for the same channel erasure rate e g [0, 1] 

from the monotonicity of g(-). In particular, u >z J/ju • F rom 
the continuity of the DE equations in Definition |4] it follows 
that u is the least solution to the equation y = Q(y), since it 
is the limiting constellation of the sequence of non-decreasing 
constellations {j/fe }j£L].. ■ 

We defer the proof of the following proposition to Appendix 
III] which is the central argument in the proof of Theorem 
8] Using the bound on the maximum transition width from 



Proposition 14 we obtain an upper bound on x for a given 
window size W and erasure rate e£#, From this, we arrive 
at a lower bound for e that guarantees x < 8 when 8 is an 
arbitrarily chosen value smaller than 8* (which depends only 
on di, d r ) and the window size is larger than W m i n (8) (which 
depends only on the code parameters di,d r ,"f and 8). This 
gives us our lower bound on the WD threshold. 

Proposition 16 (WD & FW Thresholds): Consider WD of 
the (di,d r ,j,L) spatially coupled ensemble with a window 
of size W > W min (8) = W min {8) + 7 - 1 over a BEC with 
erasure rate e. Then, we have 



WD 



(d h d r , 7 ,W,8)>(l-^S^) 



x e*™(d u d r , 7 ,W -7+1,5) 

provided 8 < 5* = (3^7) ^ , where e FW (d h d r , 7, W, 8) is 
the first window threshold. ■ 



From Proposition 16 and Equation (l9j, we immediately have 
that 

\di,d r ,~,,W,8)>(l-^8^) 



/WD/ 



(e BP WA,7) 



.l ( W^+±_ Alnln D_ c) 



provided W > W m i n (8). By making the substitution C = 
C + 1, we see that this proves Theorem [8] 

V. Experimental Results 

In this section, we give results obtained by simulating 
windowed decoding of finite-length spatially coupled codes 
over the binary erasure channel. The code used for simulation 
was generated randomly by fixing the parameters M = 1024, 
di = 3,d r = 6, with coupling length 7 = 3 and chain 
length L = 64. The blocklength of the code was hence 
n = ML = 65, 536 and the rate was R sa 0.484375. From 
Table [I] the BP threshold for the ensemble to which this code 
belongs is e BP (d l = 3, d r = 6,7 = 3, L = 64) » 0.487514. 

Fig.|2]shows the bit erasure rates achieved by using windows 
of length W = 4, 6, 8, i.e., the number of bits within each 
window was WM — 4096, 6144 and 8192 respectively. From 
the figure, it is clear that good performance can be obtained 
for a wide range of channel erasure rates even for small 
window lengths, e.g., W = 6, 8. In performing the simulations 
above, we let the decoders (BP and WD) run for as many 





k W = 4 


■ W = 6 




• W = 8 


t BP 




Fig. 2. Bit erasure probability of the (d; = 3,d r = 6,7 = 3,L = 64) 
spatially coupled code with M = 1024 achieved with a windowed decoder 
of window sizes W = 4, 6 and 8. 



iterations as possible, until the decoder could solve for no 
further bits. For the windowed decoder, this meant that within 
each window configuration, the decoder was allowed to run 
until it could solve no further bits within the window. Fig. 
[3] plots the average number of iterations for the BP decoder 
and the average number of iterations within each window 
configuration times the chain length (which corresponds to the 
average number of iterations) for the WD. We can see that for 





Fig. 3. Average number of iterations {£) for BP and WD as a function of 
the channel erasure rate is shown for each window size in solid lines. For 
the WD, we show in dashed lines, the average number of iterations required 
within each window configuration. 
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randomly chosen spatially coupled codes, a modest reduction 
in complexity is possible by using the windowed decoder 
in the waterfall region. Interestingly, the average number of 
iterations required per window configuration is independent 
of the chain length below certain channel erasure rates. The 
number of iterations required decreases beyond a certain value 
of e because for these higher erasure rates, the decoder is no 
longer able to decode and gets stuck quickly. Although the 
smaller window sizes have a large reduction in complexity and 
a decent BER performance, the block erasure rate performance 
can be fairly bad, e.g., for the window of size 4, the block 
erasure rate was 1 in the range of erasure rates considered in 
Fig. [2] However, the block erasure rate improves drastically 
with increasing window size — for the window of size 8, the 
block erasure rate at e = 0.44 was w 6.3 x 10~ 4 . 

The above illustration suggests that for good performance 
with reduced complexity via windowed decoding, careful code 
design is necessary. For a certain variety of spatially coupled 
codes — protograph-based LDPC convolutional codes — certain 
design rules for good performance with windowed decoding 
were given in [28| , and ensembles with good performance 
for a wide range of window sizes (including window sizes as 
small as 7) over erasure channels with and without memory 
were constructed. For these codes constructed using PEG 
pT) and ACE J32[ techniques, not only can the error floor 
be lowered but also the performance of a medium-sized 
windowed decoder with fixed number of iterations can be 
made to be very close to that of the BP decoder [28]. It is for 
such codes that the windowed decoder is able to attain very 
good performance with significant reduction in complexity and 
decoding latency. 

VI. Conclusions 

We considered a windowed decoding (WD) scheme for 
decoding spatially coupled codes that has smaller complexity 
and latency than the BP decoder. We analyzed the asymp- 
totic performance limits of such a scheme by defining WD 
thresholds for meeting target erasure rates. We gave a lower 
bound on the WD thresholds and showed that these thresh- 
olds are guaranteed to approach the BP threshold for the 
spatially coupled code at least exponentially in the window 
size. Through density evolution, we showed that, in fact, the 
WD thresholds approach the BP threshold much faster than 
is guaranteed by the lower bound proved analytically. Since 
the BP thresholds for spatially coupled codes are themselves 
close to the MAP threshold, WD gives us an efficient way 
to trade off complexity and latency for decoding performance 
approaching the optimal MAP performance. Since the MAP 
decoder is capacity-achieving as the degrees of variables 
and checks are increased, similar performance is achievable 
through a WD scheme for a target erasure floor. 

Through simulations, we showed that WD is a viable 
scheme for decoding finite-length spatially coupled codes 
and that even for small window sizes, good performance 
is attainable for a wide range of channel erasure rates. 
However, the complexity reduction for randomly constructed 
spatially coupled codes is not as significant as that obtained 



for protograph-based LDPC convolutional codes with a large 
girth. Thus, characterizing good spatially coupled codes within 
the ensemble of randomly coupled codes is a question that 
remains. 

The WD scheme was analyzed here for the BEC and, 
therefore, the superior performance of these codes and the low 
complexity and latency of the WD scheme make these attrac- 
tive for applications in coding over upper layers of the internet 
protocol. Furthermore, the same scheme can be employed for 
decoding spatially coupled codes over any channel. However, 
for channels that introduce errors apart from erasures, the WD 
scheme can suffer from error propagation. This effect would be 
similar to what occurs in decoding convolutional codes using 
a Viterbi decoder with a fixed traceback length. Analysis of 
the WD scheme and providing performance guarantees over 
such channels will play a key role in making spatially coupled 
codes and the WD scheme practical. 

Appendix I 
Proof of Lemma[To1 

For the lower bound, we have 

Vi = eg(Vi—y+l, ■ ■ ■ ,Vi+y-l) 
> eg( Q, ■ ■■ , , j/i+i , ■ ■ ■ ,yi+i ) 
7 7-1 

where (a) follows from the fact jT9] Lem. 24(iii)] that 

sG/i-T+i) • ■ • ) 2/1+7-1) ^ vt' 1 

where y t = 4f £j,fcio Vi+j-k- Applying this bound recur- 
sively for Vi,y i+ i, • • ■ , Vi+j-i, we get 

m > (e(^)*- 1 ) i4l * i 2/|^ 1) ' = *e-*i«-^ 

-1 

where $ = (e(^) d,_1 ) "'^ > 1 and ^ = In (^j > 0. 
When i + j = W + 1, since yw+i = 1, 

w > $ e -«*-i) w+I -\ i = l,2,-..,W 

with 4> — <fiw+i-i = ln<fr. 
For the upper bound, 

Vi < eg( y i+k ,-- ,y i+k , l,-- , 1) 

7 — fc— 1 

< e(l-a k (l-y i+k ) d '- 1 ) 
- fk(e,Vi+k) 

for k g [0,7-1], where a k = (1- (7-fe-i)(7-fc) ^,-1^ Here 
(b) follows from (19) Lem. 24(i)] 

<?(yi- 7 +i, • • • , y l+7 -i) < (1 - (1 - ft)*- 1 )*- 1 . 

Note that for k = 7— 1, / 7 _i(e, x) = /(e, x), the forward DE 
update equation for the (c?;, d r ) -regular ensemble. This proves 
the Lemma. We now discuss the utility and limitations of the 
upper bounds derived here. 
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Fig. [4] plots the bounds /fc(e, j/i+fc) for the (di = 3, d r — 
6, 7 = 3, L) ensemble for two values of e, one below and the 
other above the BP threshold e BP (di, d r ). As is clear from the 




0.15 0.3 

Ui+k 



0.45 



Fig. 4. The upper bounds fk{ e ^Ui+k) f° r two values of e : ej = 0.3 < 
e BP (di,d r ) ss 0.4294 < e 2 = 0.45 for the (dj = 3,d r = 6,7 = 3,L) 
ensemble. 

figure, the tightest bounds are obtained for k = 7— 1. Note that 
the bound when k = can be recursively computed to obtain 
a universal upper bound y u ^ on all the window constellation 
points yi for a given (di, d r , 7, L) ensemble, given by the fixed 
point of the equation 

,7 + 1, • - • -\ di- 1 



y = fofay) 



2 7 



d r -l 



which is plotted in Fig. [5] As can be seen from the plot, 
these upper bounds are only marginally tighter than the trivial 
upper bound of e. In general, we can write yw < y u b 
and use the other upper bounds /&(-,•) to obtain better 
bounds for other sections as follows. In the sequel, we shall 



write fk u k 2 ,-,k a (e,y) to denote / fel (e,/ fc2 (e, 
and similarly define 



/fc<=(e,y) - fk(e,fk{ 



Jk(e,y))) 



Thus, for j = c(j — 1) + d, < c, < d < 7 — 1, we can 
write yi < fd.(j-iy ( e , Vi+j)- The FP value of the erasure 
probability of a variable node in the first section, yi, can 
therefore be bounded in terms of the window size W as 



yi < /d,( 7 -i)=(e,2/ub), 



where c 



w-i 

7-1 



= W — 1 — 0(7 — 1). This bounding 
is particularly useful when e < e BP (di,d r ) when the fixed 



2/ub 




0.2 0.4 0.6 



Fig. 5. Universal upper bounds y u ^ on the constellation points yi as a 
function of e for the (d; = 3, d r = 6, 7 = 2, L) ensemble. These bounds 
are only marginally tighter than the straightforward upper bound e. Also, the 
bounds are non-decreasing in 7. 



point of the / 7 _i(-,-) upper bound is zero. It is sometimes 
possible that / (7 _ 1) c(e,y ub ) < /d,( 7 -i)= (e, y u b), in which 
case we can retain the tighter upper bound /(_,_]> (e, y u b)- Fig. 
[6] shows an example of the upper bound on y-y graphically. As 




Fig. 6. Upper bound, y\ > yi, for the (di = 3, d r = 6, 7 = 3, L) ensemble 
with a window of size W = 9. The channel erasure rate e = 0.3. Note that 
c = 4, d = 0, y ub ss 0.285464 and y x = (0.3, y ub ) ~ 0.0343947. 

a consequence of this upper bound, as W — > 00, we have that 
yi — > for e < e BP (<i ; ,d r ). However, for e > e BF (di,d r ), 
these upper bounds are not very useful since the FP of the 
/ 7 _x(-, •) upper bound is non-zero (cf. Fig. |4ji. 

Appendix II 
Proof of Proposition[14] 



In the following, we will use some results from |19| 
summarized below. We define 



Kv) - /(e>y) - V 



where 



f(e,y)=e(l-(l-yy^) 



l\d,-l 



the DE update equation for randomized (di, d r ) -regular en- 
sembles. For e € (e BF (di, d r ), 1), the equation h(y) = 
has exactly three roots in the interval [0, 1], given by Q,y u (e) 
and y s (e). Between and y u (e), h(y) is negative, attaining a 
unique minimum at y m i n (e). Between y u (e) and y s (e), h(y) 
is positive, attaining a unique maximum at y max (e). Beyond 
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y s (e)> h(y) is negative again. Between and y mm (e), h(y) is 
upper bounded by a line through the origin with slope 



and 



-MiO) 



ftfamin(e)) 
ymin(e) 



i.e., the line %) = -fii(e)y. Between y min (e) and y u (e), 
h(y) is upper bounded by a line passing through (y u (e),0) 
with a slope 

/i 2 (e) = mm{— — -—,h (y„(e))}. 

2/«(e) - 2/ m in(e) 

Between j/«(e) and y max (e), h(y) is lower bounded by a line 
through (y u (e),0) with a slope 

^ 3 (e =min{ -—,h (y„(e))}. 

2/ max (e) ~ y«(e) 

Between y max (e) and y s (e), /i(y) is lower bounded by the line 
through (y s (e),0) with slope 

-fii{e) = max (y s e ))}. 

Vsiq - 2/max(e) 

Beyond y s (e), h(y) is upper bounded by the line through 
(y s (e),0) with slope 

-/U 5 (e) = ti(y s (e)). 

Each of the /ij(e)'s, i = 1, •■• ,5, defined above is strictly 
positive for e in the specified range. For a general e, we will 
drop the dependence of each of these parameters on e from 
the notation. When e = e* = e MAP (d;, c? r ), the corresponding 
parameters are themselves shown with *'s. These properties 
of h(y) are illustrated in Fig. [7] We can lower bound j/ m i n 




Fig. 7. Plot of h(y) (in solid blue) for the (d; = 3, d r = 6) ensemble 
for e = 0.47 illustrating the properties stated above. We have dropped the 
dependence of all the parameters on e from the notation. The tangents at 0, y u 
and y s are shown as dashed red lines. The other lines used in bounding h(y) 
are shown as dash-dotted green lines. The /Uj's, i = 1, • • • , 5, are shown in 
the same color as the lines, whose absolute values of slopes they represent, 
that bound h(y) in various regions. 

as y m in > -pji, and the slope m as m > = fa V e e 
(e BP (d h d r )!l). 

Further, \h'(y)\ < d t d r V y G [0,1]. We have ft'(0) = 
ft'(l) = -1, h'(y min ) = h'(y max ) = 0. h"(0) = h"(y) = 
h"(l) = 0, where 



y = i 



did. 



d r 



h"(y) 



>0, ye(0,y) 
<0, ye(y,l)- 



From Rolle's Theorem, y m j n < y < y max . We first give some 
simple bounds for the /i/s defined earlier which will be useful 
in the proof. 

Lemma 17 (^4,(15 bounds): For e G (e BP (dj, d r ), 1), we 
have < Hi < H5 < 1- 

Proof: Since < for y G (y max , 1), 

monotonically decreases in this interval. Thus, < /X5 = 
—h'(y s ) < 1. From the mean value theorem, we have 

= ~ ys-V™L - h '(y») fOT SOme £ G [ymax,?/ S ] SO that 

< M4 = < -&'(».) = Ws < 1. ■ 

The values y M and y s are referred to as the unstable 
and stable fixed points (FPs) of DE for the [di, d r ) -regular 
ensemble, respectively. This is because both these values 
satisfy h(y) = or y = f(e,y) = e(l - (1 - y) dr 
The e for which the FP is y G [0, 1] is given by 

V 



-l\di 



(1 - (1 - 



The BP threshold is hence the smallest value of e(y), 



{d u d r 



min{e(i/),y G [0,1]}. The value of y 



that achieves this minimum is denoted y . Then, V e G 
[e BP (d/, d r ), 1], the unstable and stable FPs are given by 



W»(e)=l/G [0,y BP ] :e(y)=e, 
».(£) = »€ [y BP ,l] :e(i/)=e. (10) 
Fig. |8] plots these stable and unstable FPs. The reason why 




0.25 0.5 0.75 



Fig. 8. The unstable and stable FPs of DE for the (d; = 3, d r = 6)- 
regular ensemble as given by Equation \W\ . is shown as the thick 

blue curve and y s (e) as the thin red curve. By definition, y s (e) > y BP and 



Vu(e) < y BP 
that j/ s (e BP ) 



y BP and e BP = e BP (d ; 
= tf BP =w , (6 BP ) . 



3, d r = 6) are also shown. Note 



y s is called the stable FP (and y u the unstable FP) can be 
explained through Fig. [8] For e G (e BP (di, d r ), 1], when the 
forward DE updates are performed, the value y monotonically 
decreases from 1 and converges to the first solution of the 
equation h(y) = 0, which happens to be y s {t) for e in this 
range. Therefore, performing BP always results in the FP y s 
and hence the adjective "stable". Similarly for y u , which is a 
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solution never reached through BP, it can be shown that a small 
perturbation from the value of y u will result in convergence 
to either y s or 0. Therefore, y u 's are "unstable" FPs. 

We can define the derivatives y' s and y' u of y s and y u , 
respectively, with respect to e for e G (e BP (d/, d r ), 1). It is 
easy to see that y' s is monotonically decreasing and y' u is 
monotonically increasing in e. For details and proofs of the 
aforementioned properties, see fl9] Appendix II], [33) . 

We are now ready to prove the proposition. Note that when 
W is smaller than the claimed upper bound on the transition 
width, the claim is trivially true; i.e., the transition width 
cannot be longer than the window size. However, in this case, 
we cannot guarantee yi < 6. Hence, we will assume that W 
is larger than the bound. In the following, we will often use 
the bound 

/(e,j/i_ 7+ i) < yi < f(e,y i+1 -i). 

We now define a schedule that results in a FP window 
constellation that dominates the FP of the parallel schedule, 
y, for a channel erasure rate e G § . We then upper bound 
the actual transition width by the transition width of the 
dominating FP. We generate the dominating FP in steps, 
i) Set j/W = 1 and evaluate the sequence of window 
constellations {y^} according to Equation ((6j, but with 
the boundary conditions 




We have the FP in this case, y A , satisfying y A y y by 
induction. Further, 



eg(y A ,--- ,y A ,yi,--- ,J/ 7 ) 



(11) 



>eg{y?,- 



,vt) 



f(e,y A ), 



27-1 



so that yf > y s . Note that yf < y u cannot happen 
since, starting from 1, yf will equal the first solution 
of ( fTT| , which from the continuity of the DE equations is 
guaranteed to have a solution no smaller than y s . Starting 
from the right end, we now count the number of sections 
until yf < y s +TA where we choose = Va ~ Vs . Recall 
that the *-ed values correspond to e* = e MAP (c?;, d r ). We 
first observe that 

Vw-y+i < tg(Vw> ■ • • ) Vw) = /(e. Vw)- 

Hence, 

Vw ~ Vw-i+i — Vw ~ Z( e ' Vw) 

= -Hvw) > ^{Vw ~ Vs), 

which implies that y w _ y+1 - y s < (1 - ^h){Vw ~ Vs)- 
From similar reasoning, we can show that 



Vw- 



»( 7 -l) - Vs < (1 - M5)(j/^-( m -l)( 7 -l) - Vs) 

< (1 - tisTiVw ~ Vs) 

< (l-M5) m (2/ub-2/ s ). 



Since ^5 < 1 from Lemma [T7] the above difference is 
decreasing in m. From the definition of y u b (note that this 
upper bound is valid even for the boundary conditions 
specified here) in the proof of Lemma 10 it is easy to 



see that y u ^ > y s so that the right hand side of the above 
chain of inequalities is non-negative. Thus, y w _ m ^_ 1 - ) < 

y s + T A if 

m > 



In 



Let //5 = min{/i5, e G $ }. Then, we can write from the 
mean value theorem 



T. = 



Vs - Vs 



z -v' s (e) 



2 2 

for some e G [e, e*]. We can lower bound this as 

Ae 

T 



-y s 



> 



l y' s (e*) 



where the first inequality follows from the fact that y' s 
is decreasing in e in the interval (e BP (d;, d r ), 1) and the 
> e BP (di, d r , 7). Therefore this width is 



second from e* 
no more than 



(7-l)( 



In 



2(y ab -v BP ) 

y^(e*)Ae 



111 



1-Ab 



sections, since y BP < y s . 
ii) From the definition of T^, we have y s 



■r A = y* 3 -r 



A- 

Let %a be the largest index for which yf- < y* s — T^. 
Set y(°) = y A and evaluate the sequence of window con- 
stellations {y^} according to Equation (|6]l performing 
the updates only for those sections with indices i < i A . 
Further, perform the updates for the channel erasure rate 
e* since we only require an upper bound on the transition 
width. We set the left end of the window to perform these 
updates to 0, i.e., yi = V i < 0. Let y B denote the 
FP window constellation at the end of this procedure. 
By induction, we have yf<y* s — T a V i < %a- Also, 
l£_ 1 >/(e*,tf&_ 7 )so1faat 



y? A -i - Vi A —f > HviA-j) > f4(y* s - vf A 



-■yj 
> 



where the last inequality assumes that yf A - 1 ^ Vmaz 
Note that there is no loss of generality in this assumption, 
for if it were not true, we have that the number of sections 
with FP values yf between t/^ ax and y* is smaller than 
the upper bound we derive in the following. The above 
inequality implies that 

y* - y B > V ° - V ^- X 
Vs </^- 7 - 1 _ M * ' 

Similarly, it can be shown that as long as 

n B > 11* 

y%A-i— 771(7— 1) — Wmax' 

* _ B 

* _ B > Vs _ ^A-l-(m-l)( 7 -l) 

Vs Vi A -i- m (i-l) — 1 ,,* 

1 P4 

and by induction 

* b ^ y*s — y?A-i ^ ^ a 

Vs 2/i A -l-m(7-l) — 



> 



(1 - " (1 - fM* 4 y 
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Note that since [i\ < 1 from Lemma 17 the above 
difference is increasing in m. Thus, there are no more 
than 

m 2(a:-C*) 

(7-Df * ( -; )A ' +i 



In 



-><i 



sections with yf £ [y * lax ,y*]. 
iii) Let z max be the largest index i such that yf 



<v* -We 



define Tg = y " J 1 " and count the number of sections 
with FP values yf between y* + Tb and y^ ax . Since 



-(7-1)) We haVe 

. (7 _i) > Kvt^-i)) 



where we again assume without loss of generality that 

y imax-(7-l) 



> y*. The above inequality implies that 

i) 



2/imax-(7 



y« < 



y« 



i 



M5 



Again by induction, 



< 



(1 + A*a)* 



as long as y^ ax __ m(7 _i) > Since /x| > 0, the 
above difference is decreasing in to, and consequently, 



Vi, 



-771(7-!) 



< 



T B if 



Writing 



T, 



TO > 



Vu 



-y u 



ln(l + ^) 



2 2 

from the mean value theorem for some e E [e, e*], we can 
bound this as 

where the first inequality follows because — y^ is decreas- 
ing in e in the interval (e BP (d;, d r ), 1) and the second 
because e* > e BP (di,d r ,^f). This implies that there are 
no more than 



(7-1) 



In 



1 



ln(l + /i*) 

sections with FP values between y* + and y max . 
iv) From the definition of T b , we have y * + T b = Vu — ■ 
Let is be the largest index i such that y^ < y u — Tb- 



Set ?/ 



(o) _ 



y 



and evaluate {y^>} according to Equation 



|6) performing the updates only for sections with indices 
i <ib with channel erasure rate e. Again we set the left 
end of the window to while performing the updates. 
Denote the FP obtained at the end of this procedure as 
y c . Clearly, yf < y a - T B V % < i B - Since yf B _^ +l < 
f(^vf B )> we have 



y, 



v? B - 1+ i > -Hy? B ) > M 2 (y u - yf B ) 



so that 



Vu - yi B -j+i > (i + M2)(yn - y %B )- 



Here, we assume that yf > y m i n in order to obtain 
an upper bound on the number of sections in the range 
[yraimVu]- From similar reasoning as above, as long as 



y, 



1(7-!) — 2/min, 



yu - yf B _ m{l _x) > (l + ^2){yu - y^-(m-i)( 7 -i)) 

and by induction 

Vu - y? B - m (-y-i) > (! + ^) m {y u - yf B ) 

> (l + M 2) m T B . 

Since // 2 > 0, the above difference is increasing in m. 
By letting fa — min{/i2, e G #} and noting that 



y u - ymin < y BP - ymin < y BP 

we have that there are no more than 



1 



dfdr 



(7-1) 



ln(l+/2 2 ) 



1 



sections with FP values in the interval [y m in iVu]- 
v) Let ic be the largest index i such that yf < y n 
Proceeding as above, we have 



y 



*C— 7+1 



and by induction, 



y? c - mh -i) <(i-Mi) m y: 



m„,C 



Thus, between <5o and y mm , there are no more than 

BP 

(7-l)(j-t + l) 



sections with FP values in the interval [<5o>ymm], since 
y m in < y BP and fx\ > p,\. 
vi) Let ijj be the largest index i such that yf < 



So- From Lemma 11 we know that the tail decays 
doubly-exponentially for i < id- From ([8]), we have 

y c < ^g-vw-ir < y e -4>(di-i) m where 



< 



5 ^{d u d rn ) + e^(d l ,d r ) y ^ ^ - 



and 



1 , 1 1 

w = In — > 

di - 2 e ~ di - '. 

Thus there are no more than 



In 



MAP 



(di,d r ) 



4f 



(7-l)(j 



vln(d, - 1) 
sections with g [5, <5q]. 



ln ln 



In 4 



<5 ln(d, - 1) 



14 



SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY, MARCH 2012 



ln(d, 



In J 



In 



ln(l 



-a4(e*) 
M2) 



In 



In 



In 



In 



(12) 



Finally, collecting all these terms, we conclude that the tran- 
sition width of the FP obtained from the procedure highlighted 
in the steps [I]) through |yi]i is upper bounded by 



% € ,S) = ( 7 -l)(Alnln^ + Bln^- + C 



where the constants A, B and D are as follows 

1 



bx(dl - !) ' 
1 1 1 



M1 + A2) in(i + /4) Hi^*) Hj^) 1 

D = 

The constant C is as given in Equation ( p~2] >. Note that these 
constants depend only on the ensemble parameters di , d r and 
7. Since it is clear that the FP obtained through the procedure 
in steps [TJ> through |vi| above dominates pointwise the first 
window configuration FP of forward DE with a window of 
size W for channel erasure rate e, we can guarantee that the 
transition width is upper bounded by the above expression. 
This completes the proof. 

Appendix III 
Proof of Proposition[T61 

We start with the first window configuration FP of forward 
DE when the channel erasure rate is e and show that this FP 
dominates the c th window configuration FP of forward DE 
for every c for a smaller channel erasure rate v < e. To 
prove this, it suffices to show that the FP to defined in Lemma 
15 for channel erasure rate v is dominated pointwise by the 
first window configuration FP for channel erasure rate e. This 
establishes v as being a lower bound on the WD threshold 



Set = y 1 ^, the first window configuration FP of 
fc 

according to 



forward DE for channel erasure rate e. Evaluate {y^'}^° 



1=1 



where Q(-) is as defined in Lemma [TB] but for channel erasure 
rate v — e — Ae. Then, the following are true: 



U 

and 

yl 1] 



(0) 



< 



eg(0,--- ,0,yi U \--- 1 <7 



7 — 2 



vg(yf\--- ,y { °\y ( i\--- ,2/|+ 7 _i)> 1 < * < 7 



7— t 



7 <i<W. 



For 7 < i < W, 
V 



(i) _ H„ (°) = e ~ Ae „(°) < ,,(°) 
e 6 — 



(13) 



Let us write 

3i(^yi, • • • ,2/4+7-1) = 9{°,--- )0-,yi) • • • ,2/4+7-1) 

and 

Gi(e,a,yi, ■ ■ ■ ,2/4+7-x) = egifavu 1 ' ■ ,2/4+7-1) 
for 1 < i < 7. For i in this range, consider 

AO) _ (i) _ G ./ o (0) ... (o) v 

i+7- 



^i\P-,y\ ,2/1 , • • • ,2/i+7-iJ 



-Gi^.O,^ 05 ,... ,2/l+Vx) 



Gi(u,2/x 0) ,2/x 0) ) ' ' ' >2/i+^y-x) 
-G i ( W ,0,2/f ) ,---,2/|Vi) 



,(0) 



ae 



,(0)1 



Vi 



(0) 



SGi 
5a 



(14) 



where £ g [i>,e] and £ € [0,yi ]. Here, (a) follows from the 
mean value theorem. We have 



9G 
de 

Since 



: 9i{v,yi, ■ ■ ■ ,2/4+7-1) 



(0) 



e|^f, we focus on g^. Expanding out the 



expression for g t , it can be written as 



-(K,i 



7~* \d 



a)*- 1 + 



1 

— ( 

7 



1 / *" — \cL — 1 1 d r — l 

+ (a li7 _i - -a) +a iri _ l+1 



d r -l 



di-1 



where 



Oti,j+l 



1 - 7 Ecii 1/0, < j < 7 - * - 1 

1 - ^ Eciiki-7+1 2/c, 7-*<i<7-!- 



Clearly, < a iJ+1 < 1 V 1 < i < 7,0 < j < 7 - 1. 
Therefore, 



% = (rf t -l)(d r -l) 
9a 7 3l 



cr) 



+ ••• + (Qu 7 _ 4 - -cr) d " 2 - 



7 



7 



a^r(7-*)(7-*+l) . di^r a£? 
< 7^5, 2 < ^ft • 
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Here, (a) holds because < (ctfjj+i — 
j < 7 — i — 1. This implies that 



7-1-3 
7 



er) < 1 V < 



< o- e 



X,v m 



didr 



W d[d r _t 

< v d i~ 

~ 2 



(0) 



z/ (0) ) 



(0) 



(0) (0) 



x ("5(2/1 
d\d r 



w (0) ^ ' 



7 ^ 



("1 ^4 

~ 2 



,,(0) ^i" 1 



Here, the inequality labeled (b) is true because £ < yj -*, (c) 
follows from the observation that y^ < > 1, which 

is in turn true since j/v°) and Tr(y^) were non-decreasing. 
Substituting back in we have for 1 < i < 7 



(0) 



(0) 

(!) xA K 

2/> > Ae 



(0) didj 



ll (0) d '- 1 

y 7 



Thus if ^ > ^y^ 1 " 
7r(i/ ') dominates pointwise. Recall that 



> 2/i V i > 1, and hence 




It therefore follows by induction that the limiting constellation 
y{°°) exists, and is also dominated by n(y(°'). It is clear that 
yipo) satisfies 



yM = tt(?/ (oo) )- 



From Lemma [15] ^ a; and hence y[°°^ > x. 

If the window size is chosen to be W > W m i n (5) + j— 1 = 
W m i n (5), then for the first window, we can guarantee y^ < 5 
for some 5 < 5 for all channel erasure rates smaller than 



C FW 



e (di,d r , 7, W—j+1, S). From the above argument, 
it follows that we can ensure x < S for all erasure rates smaller 
than e FW fl 



6<5± 



2 

d[d r 



As long as 
1 

< 



= 5, 



0) 



.d r - 1, 

this erasure rate is a non-trivial lower bound on the WD 
threshold e WD . 
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